Depopulation is a major problem in rural areas of the world. The main aim of this work is the construction of a Spatial Depopulation Risk Index (sDRI) for the 919 municipalities of Castilla-La Mancha, using geostatistical techniques and principal component analysis. The theoretical semivariogram reveals spatial dependence up to a distance of 60 kilometers. Based on this range a neighbourhood network is constructed. Then a spatial principal component analysis (sPCA) is applied to a set of demographic variables. Finally, the sDRI is designed by extracting and scaling the first principal component of the sPCA. The resulting indicator identifies the areas with depopulation risk in which counter-measures can be applied.
Did you know that some areas of Cuenca y Guadalajara have a lower population density than Siberia? Depopulation is a major problem in rural areas of Castilla-La Mancha.
Table 1 shows that 445 municipalities of the region lost more than 20% of their population, whereas only 237 municipalities improved it in the last two decades (2001-2020).
| Population Growth Rate | Number of Municipalities |
|---|---|
| loss >20% | 445 |
| loss 10-20% | 131 |
| loss 5-10% | 62 |
| loss <5% | 44 |
| gain <5% | 43 |
| gain 5-20% | 67 |
| gain >20% | 127 |
As stated in the First Law of Geography, “Everything is related to everything else, but near things are more related than distant things” (Tobler, 1970). Figure 1 stands for the methodology used in this work:
Figure 1: Methodology
\(\color{#FFC000}{\text{The range of spatial dependence is extracted from the semivariogram}}\), the heart of Geostatistics (Montero et al., 2015), which is a tool that catch the variability, hence the spatial dependence according to the distance (see Figure 1). The expression of the semivariogram \(\gamma(s_i,s_j)\) is: \[\begin{equation} \tag{1} \gamma(s_i-s_j) = \frac{1}{2}V((s_i)-Z(s_j)), \forall s_i,s_j\in D \end{equation}\] where \(s_i\) and \(s_j\) are two municipalities of domain \(D\), \(V\) is the variance, and \(Z(s)\) is the Population Growth Rate at municipality \(s\).
Figure 2: Components of a semivariogram
Based on the adjusted semivariogram, \(\color{#453780}{\text{a neighbourhood network}}\) is constructed in the form of a proximity matrix \(L\). Then a \(\color{#453780}{\text{spatial principal component analysis}}\) (Jombart, T. et al., 2008) is applied to ten demographic variables (Jato-Espino & Mayor-Vitoria, 2023): population 2001, population 2020, youth (<16 years) 2020, elder (>64 years) 2020, growth population 2001-2020, population density 2020, natural increase rate 2010-2020, ageing index 2020, dependence index 2020 and net migration rate 2010-2020.
This set of variables define a 10-dimensional space where Euclidean distances between the 919 entities are calculated.\(\color{#453780}{\text{ Spatial PCA}}\) uses the constraint that the provided scores should be spatially autocorrelated and, thus, focuses on the spatial pattern of depopulation variability.
Two types of spatial patterns are discriminated: global and local structures, corresponding respectively to large positive and large negative eigenvalues. This is accomplished by maximizing: \[\begin{equation} \tag{3} C(v) = V(Xv)I(Xv) = \frac{1}{n}(Xv)^TLXv = \frac{1}{n}v^TX^TLXv \end{equation}\] where \(V\) is the variance, \(X\) the demographic data matrix, \(I()\) the Moran’s \(I\), which catch the spatial autocorrelation, \(L\) the proximity matrix and \(v\) the scaled axes in \(R^{10}\), with \(||v||^2 = 1\).
Figure 2 shows the extreme theoretical possibilities.
The last step is \(\color{#288A8C}{\text{extracting the first principal component of the sPCA}}\), \(\color{#288A8C}{\text{scaling it from 0 to 100}}\) and \(\color{#440E53}{\text{classify the municipalities with the sDRI}}\).
Figure 3: Theoretical cases: (a) spatial dependence, (b) no spatial dependence
This work starts with the construction of a semivariogram to analyze the spatial dependence and calculate its range. As shown in Figure 4 the semivariogram is adjusted to a spherical model with the following parameters: range of 60000 meters (60 km), sill of 3419, and nugget of 1667.
Figure 4: Adjusted semivariogram
Once the range of spatial dependence is estimated to 60 km, the neighbourhood network is constructed and the spatial analysis of principal components of depopulation in Castilla-La Mancha is performed. The results of the sPCA are shown in Figure 5.
Figure 5: Principal results of spatial principal component analysis: (a) Eigenvalues of sPCA; (b) Map of sPCA scores of municipalities.
The last step of the work is to extract the first component of sPCA and scale it from 0 to 100. The resulting sDRI is used to classify the municipalities from lack of depopulation risk (sDRI = 0) to extreme risk (sDRI = 100). As shown in Figure 6-left Albacete is the municipality with absolute abscence of depopulation risk (sDRI = 0), followed by Guadalajara, Talavera de la Reina, Toledo and Azuqueca de Henares. At the other end we have Arandilla del Arroyo (sDRI = 100), followed by Alique, Valsalobre, Angón and Pineda de Cigüela.
Finally, Figure 6-right represents the sDRI in a map of municipalities of Castilla-La Mancha.
Figure 6: Depopulation Risk in municipalities of Castilla-La Mancha according to sDRI
The applied spatial principal component analysis results in a Depopulation Risk Index which identifies numerous areas as having a medium to high risk of depopulation; namely, the majority of villages of Cuenca and Guadalajara, and the west and the south of the region. Conversely, it shows no risk for the areas of La Mancha and the Sagra and Henares industrial corridors, as well as the provincial capitals, Talavera de la Reina and Puertollano (see Figure 5).
We can conclude that Spatial Principal Component Analysis can be applied to demographic variables to construct an index to classify the municipalities of Castilla-La Mancha according to their depopulation risk. The scores of sDRI can be integrated into an expert system capable of identifying the areas in which counter-measures can be applied by local and regional governments.